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Abstract 

We consider the capacity problem for wireless networks. Networks are modeled as random unit-disk 
graphs, and the capacity problem is formulated as one of finding the maximum value of a multicom¬ 
modity flow. In this paper, we develop a proof technique based on which we are able to obtain a tight 
characterization of the solution to the linear program associated with the multiflow problem, to within 
constants independent of network size. We also use this proof method to analyze network capacity for a 
variety of transmitter/receiver architectures, for which we obtain some conclusive results. These results 
contain as a special case (and strengthen) those of Gupta and Kumar for random networks, for which 
a new derivation is provided using only elementary counting and discrete probability tools. 

I. Introduction 

A. The Capacity of Wireless Networks - Five Years Later- 

In March 2000 (exactly five years ago as of the writing of this paper), Gupta and Kumar published a 
landmark piece of work, where they presented a thorough study on the capacity of wireless networks [14]. 
For random networks, this problem was formulated as one of forming tessellations of a sphere, then 
defining routes in between cells, for which tight upper and lower bounds were obtained on their capacity. 
The main finding in [14] was actually a rather negative one: under a variety of very reasonable scenarios, 
in all cases the throughput available to each node in the network was found to be of the form ©(-4=) at 
most , for a network with n nodes - that is, this throughput becomes vanishingly small for large networks. 
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The results of [14] generated a flurry of activity in this area (surveyed below, in subsection I-D). 
However, five years later, although some progress has been made towards understanding the capacity 
of large networks in a regime in which the minimum distance among nodes remains fixed and the area 
covered grows unbound with the number of nodes, some questions related to the original setup in [14], 
dealing with high-density networks (meaning, networks with a growing number of nodes covering a fixed 
finite area) still remain, at best, only partially answered: 

• The ability to generate directed beams of energy in a wireless network could potentially change its 
behavior rather drastically, making the network “look like” a wired one. What exactly is the impact 
of directional antennas then on network capacity? 

• Despite some attempts, a pure information theoretic analysis on the capacity of high-density wireless 
networks still remains elusive. 

In this paper, we revisit the problem of capacity for random networks considered in [14]. We consider 
an entirely different problem formulation: our formalization of the network capacity problem consists of 
finding the value of a multicommodity flow problem defined on a random graph, for which we are able 
to obtain a number of results that contain those of [14] as a special case, generalizing them in a number 
of interesting directions. 


B. Problem Formulation 


Consider the following network communication problem, n nodes are uniformly distributed on the 
closed set [0,1] x [0,1], forming a random graph G = (V, E). Each node s % can only send messages to 
and receive messages from nodes within distance d n , where d n , in order for the graph to be connected 
with probability 1 (as n —> oo), has to satisfy 


2 _ log n + in 


TT dt = 


n 


( 1 ) 


for some £ n —> oo [13]. Source-destination pairs are formed randomly: for each source node one 
destination node t n is chosen by sampling uniformly (without replacement) from the set of network 
nodes (1 < i < n) - each node is both a source, a destination for some other node, and a relay for 
other nodes. All links have the same fixed finite capacity c, independent of network size. This scenario 
is illustrated in Fig. 1. 

Our goal in this paper is to determine the rate of growth of the maximum stable throughput (MST) 
for the network [30]—the rate at which all sources can inject packets, while maintaining stability for the 
system—and provided all sources inject data at the same rate. 
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Fig. 1. Problem setup, n randomly located transmitters send data to n randomly chosen receivers, all nodes act as 
sources/destinations/relays, and nodes can only exchange messages with nearby nodes (within range d n ). 


The problem of determining MST under a fairness constraint is an instance of a multicommodity flow 
problem [5, Ch. 29]: 

• There are n commodities: the packets available for transmission from transmitter s { ; to receiver t t . 

• The load on a single link contributed by all sources that use that link cannot exceed its capacity. 

• Subject to these constraints, we want to find the largest number of packets per unit of time that can 
be injected simultaneously by all sources. 

Representing our network by a graph G = (V,E), the capacity of an edge e = (u,v) by c(u,v), and 
letting our optimization variables be fi(u,v) (the flow along edge (u,v) for the i-th commodity), then 
the maximum multiflow problem above can be formulated as a linear program, as shown in Table I. 

TABLE I 

Linear programming formulation of the multicommodity flow problem with a fairness constraint. 


max An 


subject to: 


— S( Si ,n)gB /i( s 0 V )’ 

1 < i < n 

ELl fi(u,v) < c(u,v), 

(u, v) E E 

fi(U,v) = u), 

(u, v) € E, 1 < i < n 

Eng V fi( U ’ V ) = °> 

u £ V — {si,U}, 1 <i < n 


Our main task in this paper is to provide a characterization of the optimal value A*. Note that since 
the graph is random, and the LP is a function of the random graph, A* is a random variable itself. 

C. Asymptotically Tight Bounds 

Not much is known about the structure of optimal solutions to the maximum multiflow problem—the 
only technique we are aware of for deciding whether a particular amount of flow of each commodity 
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can be supported by the network consists of formulating this problem as a linear program, and then 
answering the non-emptyness question for its polytope of optimization using a standard LP solver (e.g., 
the Ellipsoid method [12]), or some of the efficient algorithms for maximum multiflow such as that of 
Karger and Plotkin [16]. Hence, we will not be able to use those formulations to do much more than 
obtain numerical results for our problem. We are thus motivated to search for an alternative formulation 
of the problem. And one such possible alternative is illustrated in Fig. 2. 



Fig. 2. In this formulation, we only consider the traffic generated by sources on the left-half of the network, with 
destination on the right-half—the traffic generated by all other source/destination pairs is discarded. 


Note that doing this amounts to introducing a restriction in the domain of optimization of the linear 
program from Table I: instead of considering all possible network flows, we only consider those which 
satisfy the constraints of Fig. 2. But what is crucial in this case is that, different from the problem of 
Fig. 1, this new problem involving flow going from the left to the right admits a regular single commodity 
flow formulation. The resulting linear program is shown in Table II. 

TABLE II 

Linear program for the single commodity flow problem. 


max nv n 


subject to: 


nvn = E„ g v/(s,w), 

u£V 

f{u,v) < c(u,v), 

( u , v) € E 

f{u,v) = 

( u, v) £ E 

E „ 6 y/(«T) = 0, 

u £ V — (s, t} 


The interest in this new linear program is due to the fact that, since it corresponds to a classical 
single commodity problem, we can try to solve it analytically using the max-flow/min-cut theorem [7], 
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However, the relationship between the optimal value A* for the “difficult” multicommodity problem, 
and no* for the “easier” single commodity problem, is not entirely straightforward. On one hand, the 
linear program in Table II is a restriction of that in Table I, since in the former some flow variables are 
constrained to 0 (that is how we incorporate the constraint of flow going only from left to right). On the 
other hand, the linear program in Table II is a generalization of that in Table I, since the latter removes 
the multicommodity constraints (all commodities are treated as a single commodity). Thus, an important 
question is that of giving a precise relationship between A* and u*. 

D. Related Work 

This work is primarily motivated by our struggle to understand the results of Gupta and Kumar on 
the capacity of wireless networks [14]. And the main idea behind our approach is simple: the transport 
capacity problem posed in [14], in the context of random networks, is essentially a throughput stability 
problem—the goal is to determine how much data can be injected by each node into the network while 
keeping the system stable—, and this throughput stability problem admits a very simple formulation in 
term of flow networks. Note also that because of the mechanism for generating source/destination pairs, 
all connections have the same average length (one half of one network diameter), and thus we do not 
need to deal with the bit-meters/sec metric considered in [14], 

As mentioned before, [14] sparked significant interest in these problems. Follow up results from the 
same group were reported in [15], [33]. Some information theoretic bounds for large-area networks were 
obtained in [19]. When nodes are allowed to move, assuming transmission delays proportional to the 
mixing time of the network, the total network throughput is 0(n), and therefore the network can carry 
a non-vanishing rate per node [11]. Using a linear programming formulation, non-asymptotic versions 
of the results in [14] are given in [29]; an extended version of that work can be found in [28]. An 
alternative method for deriving transport capacity was presented in [18]. The capacity of large Gaussian 
relay networks was found in [8]. Preliminary versions of our work based on network flows have appeared 
in [23], [24]; and network flow techniques have been proposed to study network capacity problems (cf., 
e.g., [1], [6, Ch. 14.10]), and network coding problems [17]. From the network coding literature, of 
particular relevance to this work is the work on multiple unicast sessions [20]. 

E. Main Contributions and Organization of the Paper 

Let A* denote an optimal solution to the linear program in Table I, and let v* denote an optimal 
solution to the linear program in Table II. Our first result consists of finding the asymptotic value of A* : 
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with probability 1 as n —> oo, 

0(a;) = @(K) = 0 • (2) 

This result formally establishes the equivalence between the two linear programs, in a well defined sense: 
they both have solutions which differ by at most a constant factor, independent of network size. 

A second important contribution is to show an application of the proof methods developed to estab¬ 
lish (2), to obtain the maximum stable throughput for various transmitter/receiver architectures: 

• We consider first the case of omnidirectional antennas, where we show that the scaling laws obtained 
based on our proof method are identical to those of [14]: per-node throughput is 0(-^=j===). 

• Then we apply the same proof techniques to the determination of scaling laws for a new architecture, 
in which transmitter nodes can generate a single and arbitrarily narrow directed beam, and in which 
receivers can successfully decode multiple transmissions as long as the transmitters are not co-linear. 
And in this case we find that: 

- If only enough power to maintain the network connected is radiated at each node, the maximum 
stable throughput of this network is 0 

- If now enough power is radiated to achieve MST linear in network size (certainly feasible with 
narrow beams), then the number of resolvable beams that each node must generate is 0(n). 

• Finally, we consider a node architecture in which each node is able to generate multiple and 
arbitrarily narrow directed beams, simultaneously to all nodes within its transmission range, and 
receivers operate as above. In this case we find that: 

- If only enough power to maintain the network connected is radiated at each node, the maximum 
stable throughput of this network is ©(logs n/y/n). 

- If now enough power is radiated to achieve MST linear in network size (certainly feasible with 
narrow beams), then the number of resolvable beams that each node must generate is ©(na). 

Essentially, our results show that both directional antennas, as well as the ability to communicate 
simultaneously with multiple nodes, can only achieve modest improvements in terms of achievable MST. 
While some performance gains are certainly feasible at reasonable complexities (in the order of a low- 
degree polynomial in log n), the number of resolvable beams that need to be generated to increase the 
achievable MST by more than a polylog factor is polynomial in network size, and thus exponential in 
the minimum number of beams required to keep the network connected. How many beams need to be 
resolved is a reasonable measure of complexity, since the higher this number, the narrower these beams 
need to be made, and hence the higher the complexity of a practical implementation. 
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We also believe another original contribution is given by our proof techniques: 

• Our results are obtained using only elementary network flow concepts, and the calculations involved 
require only basic probability theory, calculus and combinatorics. By formulating the problem of [14] 
as an elementary problem of flows in random graphs, we obtain what we believe is a number of 
interesting insights into the nature of this problem which were not obvious to us from their proof 
technique, as well as a set of meaningful generalizations to deal with the case of directional antennas. 

• Except for some elements of the protocol model considered in [14], most of the work we are aware 
of on this subject (e.g., [8], [11], [15], [19], [29], [33]), has focused on the use of “continuous” tools, 
dealing with Gaussian signals, power constrained channels, etc. Our work instead takes a “discrete” 
approach to the network capacity problem, tackling it primarily using flow, counting and discrete 
probability tools. Thus, we believe our proof technique, while using only elementary tools, has some 
novelty in the context of the problem considered here. 

An added benefit of our proof method is that we are able to prove strong convergence (meaning, 
convergence with probability 1) in all cases. In particular, when considering the specialization of our 
results to the setup of [14], our results are stronger, in that only weak convergence is established there. 

The rest of this paper is organized as follows. In Section II we formulate upper and lower bounds 
on the value of A*: the upper bound is evaluated in Section III. and the lower bound is evaluated in 
Section IV. Then, applications of these results in the context of wireless networking problems follow: 
in Section V we present an alternative derivation for the results of Gupta and Kumar in the context of 
random networks [14], and in Section VI these results are extended to deal with two different cases 
involving directional antennas. The paper concludes with Section VII. 

II. Asymptotically Tight Bounds on the Value of the Linear Program 

In this section we start with some preliminaries presenting the tools used to carry out our analysis, to 
then go on to formulate upper and lower bounds on the value of A*. 

A. Tools 

To compute the maximum value of a single commodity flow in our network, we use a standard result 
in flow networks: the max-flow/min-cut theorem of Ford and Fulkerson [7]. We solve this problem by 
counting how many edges can be constructed so that they all simultaneously straddle a minimum cut. 
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1) The Max-FIow/Min-Cut Theorem: f is a flow of maximum value iff \ f\ = c(S,T) (for some cut 
( S , T )). We focus our attention on one particular cut (shown also in Fig. 3): 


S = (xi,yi) e V n [0, i) x [0,1], 
T = ( Xi,yi ) G VC [|,1] x [0,1], 



Fig. 3. To illustrate the choice of a cut to derive bounds. L and R are sections of the network on each side of the 
cut boundary, of width d n , the transmission range. 


In this way, to compute the value of a maximum flow we need to determine how many edges straddle 
the x = 4 cut. To do that, we proceed in two steps. First, we compute the expected number of edges that 
straddle this cut, with this expectation taken as an ensemble average over all possible network realizations. 
Then, we derive a sharp concentration result: given an arbitrary network realization, with probability 1 
as n —* oo, we show that in this network, the actual number of edges that straddle the cut has the exact 
same rate of growth (in the 0 sense of [10]) as the ensemble mean does. 

2) Mean Values: What is the average number of nodes in a subset A C [0,1] x [0,1]? A simple 
calculation shows that 


E (Number of nodes in A) = nP(A) = n fxydA = n\A\, 

Ja 


( 3 ) 


where |A| denotes the area of A. 

3) Chemoff Bounds: In addition, to prove sharp concentration results, we need to bound the probability 
of deviations from its mean by sums of independent random variables: 

• Consider n points X\ ... X n iid and uniformly distributed on [0,1] x [0,1]. We have a number of 
subsets Aj c [0,1] x [0,1], for j = l.../(n) (the number of subsets may depend on the number of 
points n), and denote the area of any such subset by \Aj\. Now we define some random variables: 

1, X, t G Aj 

0, otherwise. 

Since the Xi s are independent, the IVy’s are also independent. 


Nij = 
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• Now let Nj be another random variable defining the number of points in A ;J , i.e., 

We see in this case that the Nj’ s, j = 1... f(n) are random variables where each is the sum of n 
iid binary random variables (but not necessarily independent among the Nj’ s themselves). 

• The expected number of points in Aj is 

( n \ n 

EM = E^)- 

i= 1 / i=l 

But, since P(2Q £ Aj) = |Aj|, we have that E(Nij) = l|Aj| + 0(1 — |Aj|) = |Aj|, and hence 

E(Nj) = n\Aj\. 

For the family of variables Nj, we have the following standard results, known as the Chernoff bounds 
(see, e.g., [22, Ch. 4]): 

1) For any 5 > 0: 

/ 6 

P[JV,>(l + <5)n|^|] < ( (1+f)W j • 

2) For any 0 < <5 < 1: 

P[Nj < (1 - 6)n\Aj\\ < e - l 2 n ^ s \ 

With a few simple calculations we can rewrite the first bound as 

e <5 \™\ A i\ 

g(l+<5) ln(l+i5) J 

= (t+<5) ln(l+<5)^ _ e (<5— ( t+<5) ln(l+5))n|Aj | = e ~0 in \^ 

where —9\ = 5 — (1 + 5) ln(l + <5). We can also rewrite the second bound as 
P[{Nj - n\Aj\) <-5n\Aj\\ < e (-| 52 )"l^l = 

where —02 — — |<5 2 . Consider now the case of 0 < <5 < 1: restricted to this range, we have that 9\ > 0; 
and 0 2 is clearly positive as well. Thus, by defining 9(5) = min(0i,0 2 ), we have 

P[ |Nj - n\Aj\\ > 5n\Aj\ ] < e~ 0n 1^1. (4) 

Our interest in (4) is because, if we can prove probability bounds of that form, then we can claim that 
^ = 0(| A ? |) with probability 1, in the limit as n —> oo. In other words, for the random variables Nj, as 
n —> oo, there exist constants such that deviations from their mean by more than these constants occur 
with probability 0. Note that as n —» oo, e~ dn ^ A ^ —* 0, so 

lim P\ |JV 7 - — n|A 7 -|| > <in|A 7 | 1 = 0, 

n— >oo L J J J J 


P[(Nj — n\Aj\) > 5n\Aj\] < ^ 
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or equivalently, 

1 = lim P\ liVj — n|A 7 || < (5n|A 7 -| 1 

n—>oo 1 J J J J 

= lim P\ 0 < (1 — 5)n\Aj\ < Nj < (1 + 5)n\Aj\ 1 

n—>oo 1 J 

= lim P\Nj = 0(n|A 7 |) 1, 

n—>oo 1 J J 

where 0 is defined in [10] as: 

0(g(n)) = {/(n) : 3ci > 0,C2 > 0,no, for which 0 < c\g{n) < f(n) < C2g(n), Mn > no-}. 

B. An Equivalent Linear Program 

As suggested in the Introduction, we will not work directly with the original linear program from 
Table I, but instead we will work with a new linear program, one in which flow is constrained to move 
from left to right. The new LP is formally stated in Table III. 

TABLE III 

Linear programming formulation of the multicommodity flow problem, in which the supply of connections other 

than those going from left to right are set to 0. 


max 

subject to: 

In 



= 5 Z(s i: v)eE fi( S i > 0! 

1 < i < n and 

Si £ S = {u <= V : u G [0, I) x [0,1]}, 

L € T = {u 6 V : u € [§, 1] X [0,1]}, 


E"=l fi{u,v) < c(u,v), 

(u, v) £ E 


IT 

cs 

II 

1 

(u, v) € E, 1 < i < n 


E„ 6 V M U ’ V ) = °. 

u £ V — (si, ti}, 1 < i < n 


fi(Si,v) = 0, 

Vs; € T = {u e V : u £ [i, 1] x [0,1]}, 
ti £ S = {u € V : u e [0, i) x [0,1]}. 


Considering sources on the left half of the network and destinations on the right, essentially says 
that in our linear programming formulation in Table I we must add the constraint of setting to 0 the 
demands of commodities such that either the source is located on the right or the sink is located on the 
left. But this constraint changes the result of the linear program only by a constant factor, and therefore 
asymptotically we get the same values from Table I and Table III. Intuitively, the reason is simple: 
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since nodes are uniformly distributed, we should have about n/2 nodes in S and about n/2 nodes in 
T with high probability; at the same time, since the source/destination pairs are uniformly distributed, 
about n/4 of the sources are placed on the left side of the network with destinations on the right side; 
therefore, by considering only traffic generated by sources in S for destinations only in T the value of 
the multicommodity problem should at most decrease by a factor of 4, and hence remains of the same 
order. To see this more formally, consider the following indicator variables: 



1, Si £ S A t i £ T 


0, otherwise, 

where S and T are given in Table III. Then, !^ n> = Ya =l is another random variable whose value is 

equal to the number of pairs with the source on the left half and the sink on the right half. We would like 
to compute how many are these pairs, to calculate the difference between the values of the two linear 
programs. To do this, we first compute the mean of I ljl K then we use the Chernoff bounds to prove a 
sharp concentration of this variable around its mean. 

We start by computing E(I( n ' ) ). We have: 


£(/M) 



E^ n) )’ 

i =1 


due to linearity of expectation. Now, 


E{l\ n) ) = i- P( Si e S AUeT) + o- P{ Si eTvueV) = P{ Si e 5 au £ T). 


Since the nodes in our network are uniformly and independently distributed, we have that the events 
{ .s t £ S} and {t, e T} are independent events, and therefore: 

E{l[ n) ) = P(si € S Ati £ T) = P( Si € 5 ) • P{U € T) = 1 

which finally gives us: 

n 

E(I^) = Y,E{l\ U) ) = 

2=1 

This expected number occurs with high probability as n —* oo because, according to the Chernoff bound, 

P[ |/ (n) -£(/ (ri) )| > 5E(I^ n) ) ] < e - 0E(/( ” ,) = e" 0 4 _> 0, 

as n —> oo. Thus, with high probability, there are about j sources in S with destinations in T. As a 
result, from the fairness constraint we have that P(d* n = A*/4) —> 1, and herefore, 0(A*) = (-)(£*,). 


March 20, 2005. 


DRAFT 


12 


C. Formulation of the Bounds 

With these tools, it is easy to describe asymptotic upper and lower bounds on A*: 

• To obtain an upper bound, we eliminate the multicommodity constraints from the linear program in 
Table III, and use the max-flow/min-cut theorem to compute the value of a maximum flow. Thus, 
we have that (-)(£*) < 0(;y*), for v n as defined in Table II. 

• To obtain a lower bound, we construct a feasible point for the linear program in Table III, to obtain 
a value 7 n for the LP for which, clearly, 7 n < t n . 

In the next two sections we evaluate these bounds, to show that 

9 (t/Jt) “ 7 " - c - eW) - e ( i 7r)- 

and thus conclude that ^* = 0 ^ lQ gT. n ^ ; and thus A* = 0 ^ lo ^ n j as well. 

III. Evaluation of the Upper Bound 

In this section, our goal is to show that z/* = 0 ^ l(l ^ 2 -" j, based on the methods outlined in the previous 
section. 

A. Counting Edges Across a Minimum Cut 

Fix a particular node on the left side of the minimum cut, L. The number of edges that cross the cut 
for that one node is exactly the number of nodes in the right side of the cut, R, that are within distance 
d n . Therefore, for an arbitrary point p = (x. y) in L = [-7 — d n . |) x [0,1], we draw a circle of radius 
d n and center (x, y). The points q = (u , v) in R, = [i, i + d n ] x [0,1] that are inside the circle are equal 
to the number of edges we want to count. These points p and q for which an edge exists satisfy the 
following conditions: (1 ) \ — d n < x < (2) either (a) 0 < y < 1, or (b) d n < y < 1 — d n \ (3) \ < u\ 

and (4) ( u — x) 2 + (y — y) 2 < d 2 . The situation is illustrated in Fig. 4. 



Fig. 4. To illustrate constraints on edges. 
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For each p = (x,y), we get the average number of points q = (u,v) within the shaded arc Q p in 
Fig. 4 using eqn. (3): E (Number of points in Q p ) = n\Q p \. 

To compute the area of Q p (denoted \Q P \), we let /) denote the angle of the arc illustrated in Fig. 4. 

0 i _Q i _ 

Then, it follows from elementary trigonometric identities that sin —, and so cos —. So, 

\Q P \ = 4??d 2 — \d n cos |2 d n sin | = \dd^ — sin i? = 4rf 2 (?? — sin??). And plugging this expression 
into n\Q p \, we get n\Q p \ = n^d 2 (?? — sin??). But from the trigonometric identities above, we have that 
r) = 2 arccos -^-4 and hence, sin r) = 2 sin | cos |, which implies sin 2 r) = 4 sin 2 | cos 2 |, which again 
implies sin 2 ?? = 4(1 — cos 2 |) cos 2 Now, since 0 < ?? < n, sin?? = 2cos |^/l — cos 2 | > 0, and 
so, finally, we get an expression for n\Q p \ in terms of n, d n , and the coordinates of the transmitter 


P = (x,y): 


n\Q p \ = -nd 2 (?? - sin??) 


— x 


I 2 X 

= nd„ arccos —-— 
1 d n 


— x 


0 , 

1 2 ?? 

2 V 

1 — COS 2 — 

1 

/I \ 2 \ 


(2 ~ X > 


dl 


The result above is the average number of edges that cross the cut, starting at a fixed point p = (x, y ) 
in L. To calculate the total number of edges S that cross the cut on average, we need to add up n\Q p \ 
over all nodes p, (i.e., compute S = )C p eL n IQpD- And our plan to do this is to approximate this sum 
by an integral. 

The value of \Q P \ is clearly dependent on the location of p: for p’s in L near the boundary of the 
cut (x <~ -j), ?? ~ 7 T and hence the shaded area is large; for p’s still in L but far from the boundary 
of the cut (x >~ _ ( ] n p -(? « 0 and hence the shaded area is small. Furthermore, except near the top 

and bottom boundaries, the area of Q p is independent of y. Therefore, to obtain a simple expression for 
the sought sum, our first step consists of dividing L into ^ thin strips of height 1 and width A (for 
A -C dn), and expanding Yl P eL n \Qp\ ' n two different ways: 

dn/ A 

So. = n\Q X y\ ■ \{p = (x,y) € L : 0 < y < 1}|; 

"-r-- 

dn/ A 

Sb = yz n\Q X y\ ■ \{p = (x,y) G L : d n < y < 1 - d n }\; 

---- 

(in both cases, we take 4 — d n + (k — 1 )A < x < | — d n + kA). S a is an upper bound on J2 p cl n \Qp\> 
since we may count edges that end up outside the network; Sb is a lower bound, since we may not count 
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some valid edges close to the network boundary; blit as long as d n —> 0 as n —» oo, both bounds become 
tight and equal to Yj P &L n \Qp\- 

The next step is to observe that once again we can approximate the size estimates s a and s b using 
eqn. (3): s a = nA and s b = n(l — 2d n )A. Hence we get: 


xy\ 



d n /A 

d n /A 



■ nA = n 2 A |Q. 


fe=l 

fc=i 


- n 2 P 

/ | Qiy | dxdy, 


Jx=\-d n 

dn/A 

Jy= o 

5 6 

k= 1 

• n(l — 2d n )A = n 2 \ 


r\ 

rl—d n 


« n 2 / 



Jx=\-d n 

Jy=d n 


dn/A 


k =1 


!xy\ 


since A Y^k=t \Q%y\ * s a Riemann sum that, as we let A —» 0, converges to the integral over an 
appropriate region of \Q P \. 

And now we are almost done. Since S b < Yl p eL n \Q P \ A S a , and we have that for n large, S a ~ 
S b « n 2 f L \Q p \, we finally get: 

,2 


^ ] n \Q P \ 

p£L 


n 


Jl \ Q r 


dp 


= n 


2 j2 

= n cL 


/■ U 

■!\-d„ JO 

/* f 

J TT—dn J 0 


arccos 


2 x 2 X 


'1- 


(i-x) 2 


d 2 


dydx 


l 


arccos —-— dydx — n 2 d 2 / / 

d n di-d n do 


t i 


a -*) 2 


dr 


1- 


dl 


dydx 


1 — X _ n .r, f 2 ^ — X 


n 2 d 2 / arccos — dx — n 2 d 2 . 

d±-d» d n Jl-d n d n 

( a ) /*0 /*0 

— — n 2 c£ / arccos u du + n 2 


/ (i — x) 2 


J arccos u du + n 2 J u\f 1 

n 2 d^ / arccos u du — n 2 / nyl 
do do 


— n 2 dn 


— n 2 dn 


n 2 dl - \n 2 d? n = §n 2 4, 


where (a) follows from the change of variable 


March 20, 2005. 


DRAFT 


















15 


B. Sharp Concentration Results 

Our next goal is to show that the actual number of edges straddling the cut in any realization of the 
network is sharply concentrated around its mean. That is, in almost all networks, the number of edges 


across the cut is 0(ra 2 d^) = ®{y/n logs (n)). 


Define a binary random variable Nij, which takes the value 1 if the /-th node is within the transmission 
range of a node at coordinates (x 3 . y 3 ) on the other side of the cut, as illustrated in Fig. 4: 

1 , Xi £ Q{ Xj ,yj) 



0, otherwise. 


Let p denote the probability that Xi is in Q( Xjjyj ) (i.e., that N l3 = 1). Then, p = \Q( Xjtyj ) = ^d‘ 2 n (i) — 
sin(i?)), with 0 < i3 < tt is as in Fig. 4. Therefore, defining as — sin(i?)), we havep = | Q( x y )\ = 



Define Nj = Y^= t ^ij as the number of points in Q( Xj , yj )- In this case, we have E(Nj ) = E( N ij ) 
= ^^= 1 P ' I + (1 — p) • 0 = np = log(n). Now, by eqn. (4), we have that 

P(\Nj — log(n)| > 5k,0 log(n)) < e ~ eK ^ los< ' n ' > = 

As n —> oo this probability tends to zero, and therefore, in almost all network realizations, a node on 
the left side of the cut is connected to 0(log(n)) nodes on the right side. 1 By an analogous argument, 
we have @(nd n ) = 0 (\/n log n) nodes on the left half. Therefore, the actual number of edges across 
the cut is 0(log 2 n) • 0(-^/nlog n), so no* = ©(yTilog^ n). 


IV. Evaluation of the Lower Bound 


To give a lower bound for l* n , we construct one feasible point: this is accomplished by giving a specific 
routing algorithm, and finding how much traffic this scheme can carry: 

1) We start by proving that, with probability 1 as n —> oo, there is a subgraph of the random graphs 
under consideration with a clear, regular structure. 

2) We then develop a (very simple) routing technique that makes use of the links in the regular 
subgraph only. 

3) Finally, we determine the throughput achieved in this way. 

’Observe that k& = 0 only over a set of measure zero (the set of network locations such that x = ^ — d n ), and thus the 
exponent can be assumed strictly positive. 
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A. Existence of a Regular Subgraph 

Consider a partition of the network area (the closed set [0,1] x [0,1]) into square cells, each one of 
area t d determine c, we observe that the side of a cell is \J ' " , and so c is chosen such that 

the inverse of this number, yj cl ™ gn , is an integer - this is to guarantee that the cells form a partition of 
the whole network, as illustrated in Fig. 5. 




Fig. 5. To illustrate the presence of a structured subgraph for large n. Consider the shaded center cell: all nodes 
within that cell are connected by an edge to every node in the cells above, below, left and right. To guarantee that 
all such edges can be formed, from Pithagoras, the transmission range d n must be chosen as d n = yj 5c *° s n . So, 
provided c > 0, connectivity of the network is guaranteed [13]. 

With the construction of grid and choice of connectivity radius d n shown in Fig. 5, each node within 
a cell will have an edge connecting it to all nodes in the four adjacent cells. 

B. A Routing Algorithm 

To describe the routing algorithm, we define first some notation: 

• Given a cell (i,j) (1 <i,3 < i^)> Vij denotes any arbitrary node v £ V contained in that cell. 

• Given a node (Uy).j(v)) denotes the cell that contains v. 

• v cun : current node; n dest : destination node. 

The algorithm executed at each node to decide the next hop of a message is as follows: 

1) If j(v can ) < j(n dest ), sen d message to (i(v curr ), j(v cun ) + 1). 

2) Else, if i(v curT ) > z(n dest ), send message to (z(n curr ) — l,j(v curT )). 

3) Else, if i('u curr ) < i(n dest ), send message to (i(n curr ) + 1, j(v cm )). 
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4) Else, and u dest are in the same cell (so u dest is reachable in one hop from v cu ”), hence stop. 
These mechanics are illustrated in Fig. 6. 



Fig. 6. To illustrate routing mechanics. A source in a cell left of the center cut sends messages to a destination on 
the right side by first forwarding data horizontally, then vertically. Under the assumption that all cells are non-empty, 
there is always a next hop. 


For this algorithm to work properly, we must insure that no cells are empty: if this condition holds, 
then we can be sure that an F-shaped path as shown in Fig. 6 will always deliver packets to destination. 
But the fact that no cells are empty is not obvious, and requires proof. In fact, we will prove something 
stronger: the number of nodes contained in any arbitrary cell is ©(log n), with high probability as n —> oo. 
Define X, t j as the number of nodes within a cell (i. j) (1 <i,j< yp Then, 

• Mean value of X t j\ 

E(Xij) = n • = clogn. (5) 

n 

• Chernoff bound on deviations from the mean for X V] : 


P{\Xij — clogn) > <5clogn) < e 0clogn = 

n c0 

Probability that the occupancy of none of the cells deviates significantly from its mean: 

P P| | Xij — clogn| < 5c\ogn I = 1 — P MJ | X l3 — clogn| > 5c log 


n 


1,3 




> 1 — P (| — c log n\ > 5c log n). 


hj 


Consider now any e > 0; there is a value no(e) such that, for all n > no(e), 

1 (a) 1 1 


^ P(\Xij — clogn) > 5c\ogn) < ^ 


n' 


c6 


h3 


h3 


c n ce 1 log n 


< e, 
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where (a) follows from the fact that there are cells, and provided c> i. Therefore, 


P P| |Xij — clogn[ < 6c log n I > 1 — e, 


. *<■? 


and thus all cells contain 0(logn) nodes almost surely, as n —> oo. 

With this, we see that all cells simultaneously will be non-empty in almost all networks. Thus, the 
routes defined by the proposed routing algorithm will always deliver data to destination. We still need to 
determine how much though. 


C. Computation of the Achievable Throughput 

The last step is to determine the throughput available to a source/destination pair constructed by the 
routing algorithm above. 

We start by stating the relatively straightforward fact that, since the routes constructed by the algorithm 
above do not split the flow at any intermediate node, the throughput of a connection is determined by the 
capacity available to that connection at the link with highest load. 2 Now, since links have a fixed finite 
capacity, and since in our problem we work under a fairness constraint that forces all source/destination 
pairs to inject the same amount of data, we have that the capacity allocated to a commodity on any 
link is —-— raw lmk capacily ——. Thus, our problem reduces to finding the maximum number of commodities 

# of commodities using that link A 

sharing a link. 

We claim that no link in the network is shared by more connections than the links which straddle the 
center cut: 

• The number of commodities sharing a link across the center cut is exactly equal to the number of 
nodes within a horizontal strip, as illustrated in Fig. 7. 



Fig. 7. Each node left of the center cut generates traffic that must cross that cut. 


2 This intuitive fact can be formalized based on Robacker’s decomposition theorem for multiflows [24], [26]. 
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• Clearly, the number of commodities on horizontal links (meaning, links going from one cell to 
another cell either left or right) decreases as we move away from the center cut: 

- Moving left, the number of sources decreases. 

- Moving right, once a connection reaches the column on which the cell containing its destination 
lies, it starts moving along vertical links and never goes back to horizontal ones. 

• The number of commodities on vertical links is at most the same as the number on links in the 
center cut - but this requires proof. 

To prove this last point, we need to count the number of commodities sharing a horizontal link crossing 
the center cut, and we have to give an upper bound on the number of commodities sharing an arbitrary 
vertical link. 

In terms of the number of commodities sharing a horizontal link across the center cut: 

• The average number of commodities across the center cut is just 

1 


n 




and again from the Chernoff bounds, we have that this is not only the ensemble average, but that 
in almost all networks, this number is 0 (y/n log n). 

By a similar argument, we have that the number of edges across the center cut in between two 
adjacent cells is 0(log 2 n) - with high probability, 0(logn) nodes in each cell, by construction 
there is a link between any two of those. 

y/n 


Thus, the number of commodities sharing a link across the center cut is 


= 0 


log 2 (n) 


In terms of the number of commodities sharing any vertical link, an upper bound on this number 
is given by # !nLfatLreen V a^ac!nrceii s • Why this is an upper bound is illustrated in Fig. 8. But then, since 
the number of nodes in a vertical strip is twice the number of nodes in -y of a horizontal strip, from 
an argument entirely analogous to the count of commodities sharing a link across a center cut in the 
paragraph above, we have that the number of commodities sharing a vertical link is at most 0 


x/re 


In summary, we have that the link sharing the largest number of commodities is shared by 0 


log 2 (n) 
\fn 

log2(n) 


of them. Therefore, by the fairness constraint, the capacity of this link is shared equally among all 
commodities, and thus this capacity is the sought 7 n value, i.e., 7 n = 0 j s achievable for the 

linear program in Table III. 
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Fig. 8. Upper bound on the number of commodities sharing a vertical link. Clearly, not all commodities that use 
a vertical link share a link across the thick horizontal line: some will switch from horizontal to vertical above 
the line and reach their destination before crossing that line, and the same will happen below. By estimating the 
number of commodities sharing a vertical link by the number of nodes in a vertical strip, we effectively say that 
all commodities in that strip use all vertical edges. This is certainly an overestimate, based on which we obtain 
only an upper bound. 


D. Remark 

Note: the “spirit” of this proof is very similar to the proof in [14, Sec. IV]: in both cases, the goal is to 
give an explicit construction to show the achievability of certain throughput values. However, the methods 
employed to analyze the throughput achieved by the routing strategies proposed differ significantly - [14] 
relies heavily on VC theory [31], whereas we only use properties of flows and the Chernoff bound. 

V. Applications to Wireless Networking Problems I: the Gupta-Kumar Setup 

Before considering more general node architectures in Section VI, we show in this section how, for 
the case of nodes equipped with omnidirectional antennas, using our proof techniques we obtain scaling 
laws identical to those reported in [14], but under strong convergence. 

A. Transmitter/Receiver Model 

In [14], transmissions were omnidirectional, and described based on a pure collision model: for a 
transmission to be successfully decoded, no other transmission has to be in progress within the range of 
the receiver under consideration. This setup is illustrated in Fig. 9. 
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Fig. 9. A transmission model based on omnidirectional antennas and pure collisions. 


B. Average Number of Edges Across the Cut 

Our first task is to determine the average number of edges that can be simultaneously supported across 
the cut, average taken over all possible network realizations. 

1) An Upper Bound: For a fixed receiver location (x, y ) in R, there can only be one active transmitter 
within distance d n of the receiver, for that transmission to be successfully received. Since to obtain an 
upper bound we only need worry about edges that cross the cut, we first consider all possible locations 
of one such transmitter in L, by drawing a circle of radius d n and center (x, y). This region is illustrated 
in Fig. 10. 


d d 



Fig. 10. For a receiver at location (x, y), at most one transmitter in the shaded region T xy can send a message (if 
this message is to be successfully decoded on the other side of the cut). 

Denoting by \T xy \ the area of the shaded region T xy in Fig. 10, we use eqn. (3) to estimate the number 

of transmitters located in T xy as n\T xy \. Flowever, since only one transmitter located within T xy can 

transmit successfully to a receiver at (x,y), the number of nodes that are able to transmit at the same 

time from L to R is upper bounded by 

E (Number of nodes in L ) riL 

E (Number of nodes in T xy ) nT xy 

This is an upper bound, because we are assuming that it is possible to find a set of locations (x, y) in 
R such that no area in L is wasted—showing that this bound is indeed tight requires proof. 
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Now, the area of L is d n . To compute the area of T xy , we have to determine the area of an arc 
of a circle with angle ??, as shown in Fig. 10, and in a computation entirely analogous to that of the 
calculation of \Q p \ in Section VI-C. In this case, we have that sin(^(7r — ??)) = = cos(^$), and 

since | < x < \ + d n it is clear that we must have 0 < ?? < ir and also sin ?? > 0. Then, we get 
\T xy \ = \dd^ — \d n cos(\d)2d n sin(|??) = \dd\ — ^d^sind, and therefore, \T xy \ = \d\(d — sin??). 
Hence, for each possible value of d, an upper bound on the number of nodes that are able to transmit at 
the same time from L to R is 

nL nd n 2 

nT xy n\d^ l {d — sin??) d n (d — sin d) 

Since this upper bound depends on the choice of receiver location (through the angle d), we will make 
this bound as small as possible by an appropriate choice of d. As noted above, 0 < d < it, and sin d > 0. 
Hence, the number of transmitters in L is smallest when d = n and sin?? = 0, i.e., when the receivers 
are located close to the cut boundary (as it should be, since it is in this case when receivers “consume” 
the maximum amount of transmitter area). In this case, we get 

2 

ird n 

as an upper bound on the number of edges across the cut. Furthermore, in this case we see immediately 
that to maximize capacity we must keep d n as small as possible—and we know from eqn. (1) that the 
smallest possible d n that will still maintain the network connected is 0 (y 7 log n/n). Therefore, replacing 
for the optimal d n , we finally get an upper bound of 0 (\Jnj log n j. 

2) The Upper Bound is Asymptotically Tight: To verify that the upper bound is tight, we give an 
explicit flow construction. Consider the placement of disks shown in Fig. 11. 




mm 

0<'l9<7T 


d n (d — sin??) 


Fig. 11. An explicit flow construction. 

Since the height of the square is 1, and we are placing nodes at distance 2 d n from each other, this 
guarantees that if there are nodes in each of the circles to create valid tx/rx pairs , then the number of 
successful simultaneous transmissions across the cut is = 0 ^ yjn / log nj . Whether all such pairs of 
nodes can be created simultaneously or not is the issue addressed next. 
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C. Uniform Convergence Issues 

Next we prove that when n points are dropped uniformly over the square [0,1] x [0,1], we have 
that simultaneously (i.e., uniformly) over all circles from Fig. 11, each one of the circles contains 
0(log(n)) points in almost all network realizations. From this, we conclude that the distribution of the 
number of edges across the cut is sharply concentrated around its mean, and hence that in a randomly 
chosen network, with probability approaching 1 as n —> oo, the actual number of straddling edges is 
indeed 0 ^ \Jn/ log(n)^. 

1) Statement of the Result: Consider we have circles centered along the x = \ cut as shown in 
Fig. 11, with centers yj = (2 j — 1 )d n , j = 1... -jf- and radius d n . Then, we have the following uniform 
convergence result: 

Proposition 1: Define Bj := \\Nj — 7rlogn| < Stt log n] . Then, as n —> oo, and for any 6 E (x, 1) 
(x ~ 0.6), we have that 

lim P 

n—>oo 



Essentially what this proposition says is that with very high probability and uniformly over j, all Af s 
contain 0(logn) nodes. 

2) Proof: Note that the area of a circle in Fig. 11 is it tf} n = 7r^p. Then, from the Chernoff bound, 
we have that for any 0 < 5 < 1 we can find a 9 > 0 such that 

P[\Nj — 7rlogn| > <57rlogn] < e - enlo & n = n ~ drt . (6) 


Thus, we can conclude that the probability that the number of nodes in a circle deviates by more than a 
constant factor from the mean tends to zero as n —* oo. This is a key step in showing that all the events 
Bj ■■= [|JVj — 7rlog(n)| < ^7rlog(n)] occur simultaneously. Now, from the union bound, we have that 



- 1 

2 d n 


- 1 

2 dfi 

p 

1_ 

= 1 -p 

U B i 

3 = 1 


But, from eqn. (6), P[Bj] < n 6n , and therefore, 


1 

2 d n 




3 = 1 


—Qti 


E p i s fi < E 1 * 

j= 1 i =1 

Putting everything together, and letting n —> oo, we have 


n 


—On 


2 d r 


n2 n6 

2-y/log n 


P 


3 = 1 


> 1- 


71 2 


— 7 tO 


2 v /Iog n 
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if and only if n8 >*■ And this is true for 5 « 0.6 and above (this follows from the definition of 8 and 
a simple numerical evaluation). 

VI. Applications to Wireless Networking Problems II: Directional Antennas 
A. On Directional Antennas and MST Issues 

We consider now an application of the techniques that were used so far to analyze the network capacity 
problem in the context of directional antennas. 

Why the interest in directional antennas? Because there is a question about wireless networks equipped 
with such antennas which we believe is very important, and for which we could not find a satisfactory 
answer in the literature. We discussed in Section V the vanishing throughput problem identified in [14]. 
But in a different segment of the research community, the use of directional antennas has also received a 
fair amount of attention in recent times. The rationale is that with omnidirectional antennas, existing MAC 
protocols require all nodes in the vicinity of a transmission to remain silent. With directional antennas 
however, it should be possible to achieve higher overall throughput, by means of a higher degree of 
spatial reuse of the shared medium, and a smaller number of hops visited by a packet on its way to 
destination (see, e.g., [4]). Furthermore, in the context of energy-efficient broadcast/multicast, it has been 
argued that the ability of a transmitter to reach multiple receivers is an important source of gains to take 
advantage of in the development of suitable protocols, such as BIP [32]. 

If we take a step back, careful reading of these previous results raises an important question: how 
much exactly is there to gain from the use of directional antennas? Could directional antennas (in which 
the width of the beams tends to zero as n gets large) be used to effectively overcome the vanishing 
maximum throughput of [14]? Although we have not been able to find answers to this question in the 
literature (and that motivated us to start working on this problem in the first place), we have found a 
couple of related results based on which we can say a-priori that the answer is probably no: 

• In [14], the authors claim that their result holds irrespective of whether transmissions are omni¬ 
directional or directed, provided that in the case of directed antennas there is some lower bound 
(independent of network size) on how narrow the beams can be made. 

• In [21], [27], for some regular networks, it is shown that enabling nodes with Multi-Packet Reception 
(MPR) capabilities [9] can only increase the total throughput of the network by a constant factor 
(~ 1.6), independent of network size. 

Given this state of affairs, it seems to us that deciding exactly how much there is to be gained by using 
directional antennas, and giving some measure of how complex the transmitters/receivers need to be made 
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to achieve those gains, is indeed a topic worth being studied. 

B. A Single Directed Beam 

1) Transmitter/Receiver Model: In this section we consider the first model based on directional 
antennas: transmitters can generate a beam of arbitrarily narrow width aimed at any particular receiver, 
and receivers can accept any number of incoming messages, provided the transmitters are not in the same 
straight line. This results in a significant increase in the complexity of the signal processing algorithms 
required at each node, and in this section our goal is to determine if and how much it is possible to 
increase the achievable MST, compared to the omnidirectional case. This model is illustrated in Fig. 12. 



Fig. 12. A single beam model for communication between nodes. 

Our goal in this subsection is to evaluate 0(z/*) and 0 ( 7 n ), for this particular architecture. 

2) Average Number of Edges Across the Cut: Since at most one edge per transmitter can be active at 
any point in time, the average number of edges going across the cut can be no larger than nd n , the average 
number of transmitters on its left side. Since L and R have the same area, the average number of nodes 
on each side of the cut is the same (and equal to ndn), and hence the maximum of nd n transmissions 
can actually be received, by “pairing up” every node from one side of the cut with every node on the 
other side. The pairing of nodes on each side of the cut is illustrated in Fig. 13. 



Fig. 13. Pairing up one transmitter in L with one receiver in R\ at most n\L\ = n\R\ = nd n such pairs can be 
formed. 

Finally we note that, under the assumption of arbitrarily narrow and perfectly aligned beams, the only 
way in which we could have multiple receivers blocked out by a single transmission is by having them all 


March 20, 2005. 


DRAFT 










26 


lying in a nearly straight line (i.e., a set of vanishing measure) under the beam of a single transmitter. But 
then, to have an actual edge count lower than Q(nd n ), we would require an increasingly large number 
of nodes falling in a decreasingly small area: under our statistical model for node placement, this event 
occurs with vanishing probability, and therefore the average edge count is Q(nd n ). 

3) Sharp Concentration Results: 

a) Number of Transmitters in L and Receivers in R: Again, consider n points X\...X n uniformly 
distributed over the [0,1] x [0,1] plane, and consider the area L on the left side of the cut, as shown in 
Fig. 13. We define variables 


1, Xi G L 

Ni = < 

0, otherwise. 

and N = t The probability p of Xi E L is p = \L\ = 1 • d n . Hence, E{Ni) = 1 - p + 0 ■ (1 — p) = 
p = dn, and E(N ) = E(Ni) = nd n . From the Chernoff bound, we know that 


P(\N - nd n | > 5nd n ) < e~ 9nd X 


Since 6 > 0, we have that as n —> oo, deviations of N from its mean by a constant fraction (independent 
of n) occur with low probability, provided d n does not decay too fast. 3 Therefore, we conclude that in 
almost all realizations of the network, the number of transmitters in L and the number of receivers in R 

is Q{nd n ). 

b) Number of Edges Across the Cut: Knowing that we have Q{nd n ) transmitters and receivers 
within range of each other on each side of the cut is not enough to claim that the number of edges that 
cross the cut is Q(nd n ). This is because, in our model for directional antennas, a receiver can successfully 
decode two simultaneous incoming transmissions provided the angle formed by the receiver and the two 
transmitters is strictly positive: if all three are on the same straight line, collisions still occur, and those 
edges are destroyed. Therefore, we still need to show that the actual number of edges is @(nd n ). And to 
do this, we need to say something about the location of points that end up in L, and not just count how 
many. To proceed, we cut the area of L into nd n rectangles of height and width d n , as illustrated in 
Fig. 14. Our goal then becomes to show that in “most” of these rectangles (meaning, in all but a constant 
fraction of them) we will have nodes capable of forming straddling edges. 


’Note that the fastest possible decay for d n , according to eq. (1), is when d n « y e ^ n . And in this case, e~ 0ndn = 
g-e^cniogn _> o as n ^ oo. If d n is any bigger, this probability goes to zero even faster. So the Chernoff bound applies for 
any connected network. 
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d d 



Fig. 14. Cutting L and R into rectangles of size d„ x -A—. 


Counting how many of the nd n rectangles in Fig. 14 contain at least one of the Q{nd n ) nodes that are 
dropped in L is an instance of a classical occupancy problem, in which k balls are thrown uniformly onto 
m bins, in the case where k = m = nd n [22, Ch. 4], Since ^ is the probability that a ball falls in any 
particular bin, the probability p of an empty bin after throwing all m balls is p = (1 — ^) m which, for 
m large, becomes approximately Therefore, the average number of empty bins is mp ~ ^ \f n log(n). 
And by the Chernoff bound, again we have that 

P(Y- ndje > 5nd n /e ) < e - 6nd ^ e , 


where Y is the number of empty bins. So, the probability that the number of empty bins is a constant 
factor away from its mean is small (again, provided d n does not decay too fast), and hence, for n large, 
almost all network realizations will have Q(nd n ) non-empty rectangles. But since transmitter/receiver 
pairs in different rectangles are not collinear, the number of edges across the cut is Q(nd n ), qed. 

4) Remarks: 

a) MST in a Minimally Connected Network: In this section, we found that the MST achievable 
by the type of tx/rx pairs considered here depends on the connectivity radius d n . If we replace d n with 
/ c.logn ^ t [ lc m i n j mum radius required to maintain a connected network, from [13]), we get 


nd r , 


n 


clogn 


n 


= 0 


n log n 


Comparing this expression with its equivalent from Section V, we see that all we gain over the case of 
omnidirectional antennas is an increase in MST by a factor of 0(logn). 

b) Minimum Connectivity Radius Resuting in MST = 0(n); In this tx/rx architecture we are 
considering the use of arbitrarily narrow and perfectly aligned directed beams. Therefore, it does make 
sense to consider the use of a possibly larger transmission range than the minimum required to keep the 
network connected, since in this case a large range does not force other tx/rx pairs to remain silent while 
a given transmission is in progress. And since by increasing the transmission range now we can increase 
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throughput, our next goal is to determine the minimum range that would be required to achieve MST = 

0(n). 

Solving for d n in 0(n) = Q{nd n ), we see that trivially, d n = 0(1). That is, to achieve MST linear 
in the number of nodes using a single beam in each transmission, the radius of each transmission has to 
be a constant independent of n. 

c) Minimum Number of Simultaneous Beams: From a practical point of view, does it matter that to 
achieve linear MST we need to keep the transmission radius constant? In this section we argue that yes 
it does, very much. To see why this is so, next we count the minimum number /3 of narrow beams that 
a transmitter would have to generate simultaneously, if MST linear in the size of the network is to be 
achieved: this number gives a measure of the complexity of the beamforming transmitter, since 2tt//3 is 
an upper bound on the maximum angle of dispersion of the beam. 

Since a node can generate a beam to any receiver within its transmission range (see Fig. 12), again 
using eqns. (3) and (4), we have that for n large, the number of points within a circle of radius d n is 
0(n • 7T d^f). In the case of d n only satisfying the requirement of keeping the network connected, 

[3 = n-ttdl = n ( j Cl ° gU ^J = 0(logn). 

This fact was known already—see [34] for a more complete analysis (constants hidden by the 0-notation 
included), including also a number of interesting references on the history of this problem. But if now 
we consider a larger d n satisfying the requirement of achieving linear MST, then 

p = n ■ 0(1) 2 = 0(n). 

Therefore, we see 3 has an exponential increase relative to the number required to maintain minimum 
connectivity—it is on this fact that we base our claim about directional antennas not being able to provide 
an effective means of overcoming the issue with per-node vanishing throughputs. 

C. Multiple Directed Beams 

1) Transmitter/Receiver Model: In this section we consider another model based on directional anten¬ 
nas: transmitters can generate an arbitrary number of beams, of arbitrarily narrow width, aimed at any 
particular receiver; and receivers can accept any number of incoming messages, provided the transmitters 
are not in the same straight line. This is perhaps the most complex scheme that could be envisioned based 
on directed beams. Our goal is to determine if and how much it is possible to increase the achievable 
MST, compared to the previous two cases. This model is illustrated in Fig. 15. 

Again, our goal in this subsection is to evaluate ©(a*) and 0 (y n ), for this particular architecture. 
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Fig. 15. A non-degraded broadcast channel model for communication between nodes: each node is able to send 


simultaneously a different packet to each one of the nodes within his transmission range. Furthermore, multiple 
broadcasts (from different transmitters) do not collide, unless the transmitters are perfectly aligned. 


2) Average Number of Edges Across the Cut: With minor variations, this calculation is essentially 


identical to that presented in Section III-A, and the final result is the same: the ensemble average number 
of edges straddling the center cut is (-)(n 2 df). See Section III-A for details. 

3) Sharp Concentration Results: Our next goal is to show that the actual number of edges straddling 
the cut in any realization of the network is sharply concentrated around its mean. That is, in almost all 
networks, the number of edges across the cut is 0(n 2 c^), 

a) Number of Receivers per Transmitter: Define a binary random variable N lJy which takes the 
value 1 if the i-th node is within the transmission range of a node at coordinates ( Xj,yj ) on the other 
side of the cut, as illustrated in Fig. 4: 



1 , Xi € Q{ Xj , yj ) 

0, otherwise. 


Let p denote the probability that X t is in Q( Xj , yj ) (i.e., that = 1). Then, p = \ Q( Xj , yj )\ = ^d 2 (r) — 
sin(i?)), with 0 < i? < n is as in Fig. 4. Therefore, defining k& as sin($)), we havep = \Q( Xj , yj )\ = 



Define Nj = Y^i=i ^vi as l ^ lc number of points in Q( Xj ,y j )- In this case, we have E(Nj) = XT=t = 

P ' 1 + (I — p) ' 0 = rip = log(n). Now, again from the Chernoff bound, we have that 

P(\Nj — K^log(n)| > <fctflog(n)) < e ~ 0K " l>log ( n ) = 

for 6 defined as in previous applications. As n —> oo this probability tends to zero, and therefore, in 
almost all network realizations, a transmitter on the left side of the cut will be able to reach 0(log(n)) 
receivers on the right side. 

b) Total Number of Edges: In a manner analogous to the situation discussed in Section VI-B, 
knowing that there are Q{nd n ) transmitters on the left side of the cut, and that each transmitter can reach 


March 20, 2005. 


DRAFT 





30 


0 (nd 2 ) receivers on the other side, is not enough to conclude that the total number of edges going across 
the cut must be 0(n 2 d^). This is because of our requirement that multiple transmitters not be perfectly 
aligned with a receiver for this receiver to decode all these messages simultaneously. Therefore, we still 
need to show that the actual number of edges is 0(n 2 d^). And to do this, we need to say something 
about the location of points in R that can be reached from L, and not just count how many. To proceed 
then, we cut the area of Q p into /c# log(n) slices, each slice of area = ^, as illustrated in Fig. 16. 



Fig. 16. Cutting the shaded arc Q xy into regions of area to formulate this as an occupancy problem analogous 
to that of Fig. 14. 

As in the occupancy problem considered in Section VI-B, our goal is to show that in “most” of 
these arc slices (most meaning, in all but a constant fraction of them) we will have nodes capable of 
forming straddling edges. This is again a problem of throwing k balls uniformly into m bins, where 
k = m = Kfl log(n). And again, we have that with probability that tends to I as n oo, the number of 
empty bins is K$\og(n) / e, and hence the number of occupied bins is 0(log(n)). 

Consider now a fixed transmitter located at some coordinates (x,y). Any other transmitter located at 
coordinates (x',y') / (x,y) defines a unique straight line that goes through (x,y) and (x',y'). If there 
is a receiver on the other side of the cut along this line, within reach of both transmitters, then those two 
edges will be lost—and those will be the only lost edges, from among the k,/ log(n) that each transmitter 
has. This situation is illustrated in Fig. 17. 


d d 



Fig. 17. To illustrate how we could end up losing edges: if the two black transmitters attempt simultaneously to 
communicate with the gray receiver, a collision will occur, and none of the edges will be created. 
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And then we are done. We have established that in almost all network realizations, there are Q{nd n ) 
transmitters within each side of the cut, that each transmitter can reach 0(cZ 2 ) receivers on the other side 
of the cut, and that integrating out we obtain exactly (-){n 2 d'f) edges going across the cut. Therefore, 
the actual number of edges across the cut is sharply concentrated around its mean, qed. 

4) Remarks: 

a) MST in a Minimally Connected Network: Substituting for d n = y c *° g n in | n 2 d^, we get 

3 

|n 2 ^ 1 °^ U ^ = |\/™log 2 n = 0 

Comparing this expression to the ones obtained in Sections V and VI-B, we see that the MST gain 
due to the use of multiple simultaneous, arbitrarily narrow beam is, at most, 0 (log 2 (n)). 

b) Minimum Connectivity Radius Resuting in MST - 0(n): The minimum d n resulting in linear 
MST is obtained by solving for d n in G(;n 2 d 2 ) = 0(n). Now, for n large enough, there exist constants 

Cl < c 2 G M (Cl > 0 and c 2 < oo), such that cin < | n 2 e^ < c 2 n, or equivalently, cifn _ 3 < d n < 
c 2 |n _ 3. Therefore, 

d n = 0(n“ 3 ). 

c) Minimum Number of Simultaneous Beams: In Section VI-B, we said that keeping the transmission 
range constant resulted in an impractically large number of beams that the receiver needed to generate, 
if linear MST was to be achieved by increasing the complexity of the signal processing algorithms. 
But if we generate multiple beams, we have just shown that this minimum radius now is no longer a 
constant, but instead tends to zero as 0(n - 3). However, the situation is not much better compared to the 
single beam case, and to see this again we compute the minimum number of simultaneous beams that a 
transmitter would have to generate. 

If now we consider the larger d n satisfying the requirement of achieving maximum stable throughput 
linear in network size, then 

/3 = n ■ 7rd 2 = n0(n _ 3 ) = 0 ( 77 , 3 ). 

Therefore, we see that while (3 is smaller than in the case of the single beam, we still have an exponential 
increase relative to the number required to maintain minimum connectivity—so again, we claim that 
directional antennas are not able to provide an effective means of overcoming the issue with per-node 
vanishing throughputs. 
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VII. Conclusions 

A. Summary of Contributions 

In this paper, we have showed how network flow methods can be used to determine (to within constants) 
the maximum stable throughput achievable in a wireless network. This was done by formulating MST 
as a maximum multicommodity flow problem, for which tight upper and lower bounds were found. In 
the process, the difficult multicommodity problem was proved equivalent to a simpler single commodity 
problem, solvable using standard arguments based on flows and cuts. 

As mentioned in the Introduction, this work grows out of our desire to cast what we deem to be the 
most useful insights in [14] (basically, that the constriction in capacity results from the need to share 
constant capacity links by a growing number of nodes), in a form that makes more intuitive sense to us. 4 
And we feel we have accomplished that: 

• By reducing the problem to counting the average number of edges that cross a cut and then proving a 
sharp concentration result around this mean, the computational task becomes very simple, involving 
only elementary tools from combinatorics and discrete probability. In [14], similar results had been 
obtained based essentially on generalizations of the Glivenko-Cantelli theorem (that add uniformity 
to convergence in the law of large numbers), due to Vapnik and Chervonenkis [25, Ch. 2], [31]. 

• In our formulation, it is straightforward to see that capacity limitations arise essentially from the 
geometry of the problem: edges have a constant capacity, and only about ^Jn of them are available 
at a minimum cut to transport the traffic generated by n sources. 

B. Future Work 

In terms of future work, there are a number of interesting questions opened up by this work. One 
deals with the generalization of these results to nodes distributed on arbitrary manifolds (instead of 
the square [0,1] x [0,1] C M 2 ). Another deals with exploring other combinatorial structures (such as 
hypergraphs [3]), to develop better collision models, especially in the omnidirectional case. Of particular 
interest to us however is the development of a purely information-theoretic formulation for the results 
in this paper, by exploiting the connections between Shannon information and network flow theory 
discovered in [2]. 

4 However, we certainly would have not been able to obtain our simpler and more general proofs without the insights provided 
by cultivating an appreciation for the line of reasoning employed by Gupta and Kumar in [14], 
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